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^ ! Abstract 

O ; We present a perfect simulation algorithm for stationary processes mdexed by Z, with 

^Q . summable memory decay. Depending on the decay, we construct the process on finite or semi- 

^^ [ infinite intervals, explicitly from an i.i.d. uniform sequence. Even though the process has infinite 

r~| ■ memory, its value at time depends only on a finite, but random, number of these uniform 



variables. The algorithm is based on a recent regenerative construction of these measures by 
Ferrari, Maass, Martinez and Ney. As applications, we discuss the perfect simulation of binary 
autoregressions and Markov chains on the unit interval. 
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1 Introduction 

In this paper we consider processes with transition probabihties that depend on the whole past 
history, i.e. processes with long memory. When this dependence decays fast enough with time, 
we exhibit a regenerative construction which, besides yielding an explicit proof of existence and 
uniqueness of the process, can be transcribed into a perfect simulation scheme. 

Processes with long memory have a long history. They were first studied by Onicescu and 
Mihoc (1935) under the label chains with complete connections {chaines a liaisons completes). 
Harris (1955) proposed the somehow less used name of chains of infinite order. Doeblin and 
Fortet (1937) proved the first results on speed of convergence towards the invariant measure. 
Harris (1955) called these processes chains of infinite order, and extended results on existence 
and uniqueness. The chains appeared also as part of the formalism introduced by Keane (1971 
and 1976) to study of subshifts of finite type (or covering transformations). In this theory the 
transition probabilities are called g-functions and the invariant measures g-measures. 

The theory of long-memory processes has found applications in the study of urn schemes 
(Onicescu and Mihoc, 1935b), continued-fraction expansions (Doeblin, 1940; losifescu 1978 
and references therein), learning processes (see, for instance, losifescu and Theodorescu, 1969; 
Norman, 1972 and 1974), models of gene population (Norman, 1975), image coding (Barnsley, 
Demko, Elton and Gerinomo, 1988), automata theory (Herkenrath and Theodorescu, 1978), 
partially observed — or "grouped" — random chains (Harris, 1955; Blackwell, 1957; Kaijser, 
1975; Pruscha and Theodorescu, 1977; Elton and Piccioni, 1992) and products of random 
matrices (Kaijser, 1981). For further references we refer the reader to Kaijser (1981 and 1994) 
[from which most of the material of the previous review paragraphs is taken] and to losifescu 
and Grigorescu (1990). 

It is clear that these applications should benefit from the construction and perfect-simulation 
scheme presented here. As an illustration, we discuss in Sections H and P applications to binary 
autoregressions and to the Markov processes on the interval [0, 1] defined by Harris (1955) by 
mapping chains with complete connection into D-ary expansions. 

In this paper we rely on a regenerative construction of the chain, which generalizes, in some 
direction, those existing in the literature. This type of construction has been first introduced 
by Doeblin (1938) for Markov chains with countable alphabet. Schemes for more general state 
spaces came much later (Athreya and Ney, 1978; Nummelin, 1978). The first regenerative 
structures for chains with complete connections were proposed by Lalley (1986, 2000) and 
Berbee (1987) for chains with summable continuity rates. An explicit regenerative construction 
was put forward by Ferrari, Maass, Martinez and Ney (2000) in the spirit of Berbee's approach. 



In the present paper we take up the scheme of Ferrari, Maass, Martinez and Ney (2000), 
extend it to part of the Harris uniqueness regime and transcribe it as a perfect simulation 
algorithm. Basically, the construction used here can be interpreted as a simultaneous coupling 
of all histories, built in such a way that at each instant i there is a (random) number ki > 
such that the distribution of the move i + 1 is the same for all histories agreeing the ki preceding 
instants. This independence from the /cj-remote past yields the times r such that preceding 
histories are irrelevant for future moves. These are the regeneration times, defined by the 
conditions ki < i — t for all i > t. Both Berbee's and Lalley's constructions rely on the 
regeneration probability being positive, a fact that seems to hold only for summable continuity 
rates. In contrast, our construction extends to cases where (global) regeneration may have 
probability zero. Non-summable — but still not too slowly decreasing — rates inside the Harris 
uniqueness regime yield local regeneration times, that is, regenerations for finite time intervals 
(windows). In ergodic-theory terms, our construction is in fact a finitary coding of a process of 
i.i.d. uniform variables in the interval [0, 1]. 

Perfect simulation became popular after Propp and Wilson (1996) introduced the cou- 
pling from the past algorithm to simulate invariant measures of Markov chains. Wilson's page 
pittp : //dimacs . rutgers . edu/~dbwilson/ exact provides updated and extensive references on 



perfect simulation. Foss and Tweedie (1998) and Corcoran and Tweedie (1999) proposed a gen- 
eral framework based on regeneration schemes for Markov chains and the so-called "stochastic 
recursive sequences". These are processes defined by Xn+i = f{Xn,Cn), where ^ is a stationary 
process. In the Markovian case S,i are i.i.d.; in the non-Markovian case, it remains the matter 
of how to construct /simulate the sequence ^j. Our algorithm applies to a wide variety of non- 
Markovian processes and, through the formalism of random systems with complete connections, 
it can be used to simulate Markov processes with state space of large cardinality (e.g. the unit 
interval). Section ^ present an example along these lines. 

Our main result is an explicit construction, as a deterministic function of a sequence of i.i.d. 
random variables uniformly distributed in [0, 1] , of realizations of the stationary chain with 
infinite memory. As corollaries we get: (i) an alternative proof of the existence and uniqueness 
of the stationary process, and (ii) a perfect simulation algorithm and a regeneration scheme for 



this process. These results are summarized in Theorem 4.3 and Corollaries 4.12, 4.14 and 4.18. 
In Section |^ we introduce the basic definitions and in Section |3| we illustrate the simulation 
on a concrete example, which is continued in Section ^ The results of Section ^ are proved 
in Sections |5[|^ and ^, though the perfect simulation algorithm is described in Section in its 
general version. 



2 Definitions 

We denote by G our alphabet, N* = N \ {0} and -N* = {-i : i G N*}. In what follows G 
can be finite or countable, though in the latter case conditions ( [4.4D and ( [4.g| ) below impose 
severe limitations. The set G^^' is the space of histories; we write uj\ := {wh,Wh-i, . . . ,Wa) 
for — cxD < a < 6 < +oo. For shortness we write w = u:'^^. Let P : G x G^^* -^ [0, 1] be a 
probability transition kernel; that is, P{g\wZoo) > for all g and 

5]P(5klL) = l. (2.1) 

for each i^Zoo ^ G~^ . The kernel P defines, by telescopic products, what in statistical mechan- 
ics is called a specification. A specification is a consistent system of conditional probabilities, 
where consistency is required for all histories wZqo- ^^ standard probabilistic treatments, such 
requirements are made only almost surely with respect to some pre-established appropriate 
measure. But in the present setting the determination of the appropriate measure is part of the 
problem, and stronger requirements are a priori necessary. 

Denoting by ri{i),i € Z, the coordinate mappings on G , we say that a (non-necessarily 
stationary) probability measure v on G^ — or a process with distribution v — is compatible with 
the specification P if the latter is a version of the one-sided conditional probabilities of the 
former: 

'^{v ■■ ri{i) = g I r]{i + j) = Wj, j G -N*) = Pig\wZlo) (2-2) 

for all i £ 1i, g £ G and v-a.e. wZ^^ G G~ . Then the identities 

n 

v\rj : r]{i + k) = gk,k £[0,n] r]ii+ J) = Wj, j G -N*j = Y[P{gk\gk-i, ■ ■ ■ 9o,wZl^) , (2.3) 

A:=0 

hold for z^-a.e. wZ^o^ where the concatenation is defined by 

{Wk,Wk-l,...,Wi,zZlo) ■= iWk,Wk-l, . . . ,Wi, Z^i, Z-2, ■ ■ ■) ■ 

The regenerative construction of this paper is based on the functions 
ao{g) := mf[p{g\zZl) ■ zZl G G-""'} 

ak{9\wZl) ■■= in{[p{g\w^i,...,w.k,zZl) : zZIgG-""*} , k>l, (2.4) 

defined for k £ N, g,W-i, . . . ,w^k ^ G. [These functions are denoted g{io\i-i, . . . ,i-k) by 
Berbee (1987)]. The numbers 

Uk := ini^ "^ akig\wZl) , (2.5) 

^-fe g&G 



fc G N, determine a probabilistic threshold for memories limited to k preceding instants. The 
sequences ak{g\wZj^) and a^ are non-decreasing in k and contained in [0, 1]. 

For our construction we shall use a sequence U_ = (Ui : i G Z) of independent random vari- 
ables with uniform distribution in [0, 1[, constructed on the corresponding canonical probability 
space {Q,T,F). We denote E the expectation with respect to P. 



3 An example: binary autoregressive processes 

To motivate the method, we present an example that shows how to construct and perfect 
simulate a process with infinite memory. Let us consider binary autoregressive processes. Such 
a process is the binary version of autoregressive (long memory) processes used in statistics and 
econometrics. It describes binary responses when covariates are historical values of the process 
(McCullagh and Nelder, 1989, Sect. 4.3). 

Let the state space be G = {— 1,-|-1}, Oq a real number and {Ok',k > 1) a summable real 
sequence. Let q : M i-^]0, 1[ be strictly increasing and continuously differentiable. Assume that 



P{ ■ l^-oo) is the Bernoulli law on {—1, +1} with parameter qiOo + /^ Ok 



kW-k 



(3.1) 



fc>i 



i.e., P{+l\w-_lo) = qieo + Zk>i ^kW-k) = 1 - P{-l\wZl). 

By compactness there exists at least one process compatible with (p.l|); the conditions for 
uniqueness are well known. The question is how to simulate (construct) such a stationary 
process. To do that we construct a family of partitions of the interval [0, 1[ indexed by k and 



w 



~_j. using ak{l\w_j^) and a^ defined in the previous section. Letting 



rk 






we have 



aki^wj^^) = g' 6*0+ ^ 9mW-m - rk 



l<m<k 



and 



ak 



1 — sup < 



ak{-l\w_l) = 1-g 6*0+ ^ 9mW-m + rk 




l<m<k 



l<m<fc 



iW-m + rk 



+ E 



(3,2) 



mU!—m 



rk 



l<m<k 



(3.3) 



For each k and wZj^, let Bfc(5|it;~^) be intervals of length ak{g\wZf:) — afc-i(5|wZfc_|_i) placed 
consecutively in lexicographic order in k and g, starting at the origin. Use the convention 
aQ{g\wQ ) = 00(5) and a_i = 0. These intervals form a partition of the interval [0, 1[. 

Let {Ui : i £ Z) he the sequence of i.i.d. random variables uniformly distributed in [0, 1[ 
defined at the end of Section |2|. For n € Z define the random variable 

Kn:=J2'' ^{^n G [flfc-i, flfe)} . (3.4) 

fc>0 

In our construction the variable Kn := Kn{U) indicates how many sites in the past are needed 
to compute the state at time n. To each n associate an arrow going from n to n — Kn- The state 
at site n will be independent of the states at t < s if no arrow starting at {s, . . . ,n} finishes to 
the left of s. Let r[n] := T[n\{U) be the maximum of such s: 

T[n] := max< s < n : Uj < aj-s, j ^ [sjIt] > 

= max< s < n : Kj > s, j £ [s, n] > (3-5) 



Notice that T[n] is a stopping time for the sequence (Un-k '■ k > 0). We show in Theorem O 
that the condition 

^rfe<cx) (3.6) 

k 

is sufficient to guarantee P(r[n] > —00) = 1 which, in turn, is an equivalent condition to the 
feasibility of the following construction. 

Simulation (construction) of the stationary measure 

1. Generate successively i.i.d. uniform random variables Un,Un-i, ■ ■ ■■ Stop when [/,-[«] is 
generated. Using (^^), the values Kn, Kn-i, ■ ■ ■ , K^-^n] are simultaneously obtained. 

2. Use the [/,-[«]! ■ ■ ■ ,Un and -^'^[n], • • • , Kn generated in the previous step to define 

Xj = g if U,e(j B,ig\X^Zl) , (3.7) 

£=0 

recursively from j = r[n] to j = n. 

3. Return X„. The algorithm has also constructed X^^n]j ■ ■ ■ 1 ^n-i- 



The expression ( |3.7D is well defined because by the definition of T[n], j — Kj > T[n], Uj G 
[0, axj [C UggG U£=o ^iidl^j-e ) aiid the set in ( |3.7| ) depends at most on Xj^Kj , ■ ■ ■ , ^j-i- This 



is discussed in detail in Section y; see (|6.3|). We show in Theorem O that the above algorithm 
constructs a realization Xn of a random variable which has the one-coordinate marginal of the 
unique measure compatible with the specification. To construct a realization of the measure in 
a finite window, just repeat the algorithm for other n reusing always previously generated Uj. 
The algorithm induces a function <I> : [0, 1[ — > {—1, 1} . Given the event {T[n\ = A:}, X^ — the 
nth coordinate of <l>(t/^) — depends only on [7^, . . . , Un- (To be rigorous, one should give the 
definition of <& when T[n\ = -co for some n; this is an arbitrary but irrelevant choice as this set 
has probability zero under our hypotheses.) 

This construction exploits a well known fact. The existence of a renovating event gives rise to 
a perfect simulation algorithm and a regenerative structure. In our case there is a regeneration 
at time s if j — Kj > s for all j > s (no arrow passes over s). However for the construction of 
the measure in site n we use a weaker condition: it suffices that no arrow passes from j to the 
left of s for j G [s,n]. 

While in some cases one may have explicit expressions for a^, in general, this will not be the 
case (see (^.3|)). An useful aspect of our construction is that in these cases we can work with 
lower bounds a^. We shall discuss this issue in Section p. 



4 Results 

The existence of our regeneration scheme depends on the non-increasing sequence 

m 

f3m:=llak, (4.1) 

A:=0 

with afc defined in ( |2.5| ), and a necessary (but not sufficient) condition is limk^oo o-k = 1- The 
regeneration time for a window [s,t], for — oo < s < cx) and s < t < oo, is the random variable 

t[s, t] := max< m < s : Uk < ak-m , ^ ^ ["t-j t] > (4-2) 

which may be — oo. Notice that T[s,t] = min{r[n] : n E [s,t]} and that it is a stopping time for 
{Ut-k '■ A; > 0), in the sense that {t[s, t] > j} € J^{Ui : i G [j, t]) for j < s, where T{Ui : i G [j, t]) 
is the sigma field generated by (f/j : i G [j,t]). 
Our main result is: 

Theorem 4.3 // 

J^ /3m = oo (4.4) 

then 



(i) For each finite [s,t] C Z, P(T[s,i] > — oo) = \, where T[s,t\ is defined in 
(a) There exists a measurable function $ : [0, 1[ — > G — described in Section |g — such that 

fi := ¥{^{U) € •), (4.5) 

the law of^{U_), is compatible with P. Moreover, the distribution fi is stationary. 

(Hi) In addition, the function <I> has the property that for each finite interval [s,t] C Z, its 
restriction {^{U_)ii) '■ i € [s,t]} depends only on the values ofUi in the interval [T[s,t],t]. 
More precisely, for i e [s,t], 

^{U){i) =$(..., Vr-2, Vr-1, Ur, ..., Ut,Vt+l,. . .)« (4.6) 

for any sequence v G [0, Ip (we abbreviated T[s,t] as t). 
(iv) The law ofT[s,t\ satisfies the following bound: 

t-s 

¥{s -T[s,t]>m) < Y^ pm+i (4.7) 

i=0 

where pm is the probability of return to the origin at epoch m of the Markov chain on N 
starting at time zero at the origin with transition probabilities 

p{x,x + l) = a^ 

pix,0) = (1-a,) (4.8) 

and p{x,y) = otherwise. In particular, if (1 — a^) decreases exponentially fast with k, 
then so does p^. // (1 — a^) ~ k~'^ with 7 > 1 then p^ decreases with the same power. 

H If 

(3:= lim /J^ > (4.9) 

m— >oo 

then items (i), (Hi) and (iv) above hold also for t = 00. 
This theorem is proven in Sections ^ and |^. More detailed bounds on the parameters pm 



are given in Proposition 5.13. Conditions ([4.4| ) and (4.9) require oq > as in Harris (1955). 
Both conditions are noticeable weaker than those imposed by Lalley (1986), Berbee (1987) 
and Bressaud, Fernandez and Galves (1999b), as well as those corresponding to the 5- measure 
approach (Ledrappier, 1974) and to Gibbsian specifications (Kozlov, 1974). 



Remark 4.10 The construction can be performed replacing the a^'s with lower bounds, that 
is, considering a sequence a^ in ]0, 1[, such that 

al<ak for fc > . (4.11) 

The theorem is valid replacing the unstarred ak by starred ones, and using the corresponding 
starred versions of Pm, t and pm- While the actual a^s give shorter regeneration times, they 
could be hard to estimate. Suitable choices of a^ could provide a reasonable compromise between 
shorter regeneration times and feasible calculational efforts (Section P). 

Our first corollary is the uniqueness of the measure compatible with P. 

Corollary 4.12 (Loss of memory and uniqueness) 

(i) Every measure /i compatible with the specification ¥ has the following loss-of-memory 
property: If f is a function depending on the interval [s, t] and i < s, 

t-s 

IJ-l^f VU) = Wj, j < ij - nij r]{j) = Vj, j <ij < 2 ||/||oo "^Ps+j-i (4.13) 

j=0 

for every w,v € G . Here we use the notation pf := J p{dri)f{'q). 
(a) IfJ2m>of^^ ~ ^^ ^^^ measure p defined in ( \4-3{ ) is the unique measure compatible with P. 

The uniqueness result is not new. Under the more restrictive condition (^^) it was already 
obtained by Doeblin and Fortet (1937). Harris (1955) [see also Section 5.5 of losifescu and 
Grigorescu (1990) and references therein] extended this uniqueness to a region that coincides 
with ( [4.4p for two-symbol alphabets but it is larger for larger alphabets. Other uniqueness 
results, in smaller regions, were obtained by Ledrappier (1974) and Berbee (1987) in differ- 
ent ways. Results on loss of memory were also obtained by Doeblin and Fortet (1937) (see 
also losifescu, 1992), under the summability condition ( |4.9D . Bressaud, Fernandez and Galves 
(1999b) extended those to a region defined by a condition slightly stronger than ([4.4| ). The 
rates of loss of memory obtained in this last references strengthen those of Doeblin, Fortet and 
losifescu, but are weaker than ours. 

A corollary of (ii) and (iii) of the theorem is a perfect simulation scheme: 

Corollary 4.14 (Perfect simulation) Let P be a specification with X]^>o Pm = oo and p the 
unique measure compatible with P. For each finite window [s, t] there exist a family {^rn,u rn < s) 
of functions ^m,t ■ [0, 1[[™'*1^ G[™'*1 such that 

p{r, : ri{i)=g„ie[s,t]j = ^{u : ^r,t{Ur, ■ ■ ■ ,Ut){i) = g^, i e [s,t]^ (4.15) 

where r = T[s,t] is the stopping time defined in ( |^.^ . 

9 



Expression ( [4. 151) is our perfect simulation scheme for /x. Possible implementations are 
discussed in Section 0. Roughly speaking the algorithm goes as follows: 

1. Produce a realization of Ut, ... , Us, Us-i, . . . , U^-^g^iy, 

2. Compute ^r[s,t],t{^r[s,t], ■ ■ ■ , Ut). 

3. Return the configuration {^T[s,t],t{UT-[s^t]T ■ ■ ^Ut){i) : i S [s,t]). Its law is the projection 
of fi in the window [s,t]. 

Our algorithm shares two features with the famous Coupling From The Past (CFTP) algo- 
rithm for Markov chains, introduced by Propp and Wilson (1996): (1) The r.v. Ui are generated 
sequentially backwards in time, until the stopping time t[s, t] is discovered. (2) The algorithm is 
based on a coupled realization of the chain for all possible initial conditions ensuring the coales- 
cence of the different trajectories before the observation window. Restricted to the Markovian 
case, our renewal times are, in principle, larger than the coalescence times of well designed 
CFTP algorithms. Nevertheless, our approach, besides extending to non-Markovian processes, 
yields coupling times that depend only on the numbers a^ defined in (|2.5| ) [see ( 4.16| )], and hence 
that are only indirectly related to the cardinality of the alphabet. 



The bounds in (iv) of Theorem 4.3 can be used to control the user- impatience bias. This 
picturesque name proposed by Fill (1998) relates to the bias caused by practical constraints 
such as time limitation or computer storage capacity which force the user to stop a (long and 
unlucky) run before it reaches perfect equilibrium. Indeed, suppose that while sampling window 
[s, t] we decide to abort runs with regeneration times t[s, t] > M for some large M > 0, causing 
our algorithm to produce a biased distribution Jiffn. Applying Lemma 6.1 of Fill (1998) and 
(E^), we obtain: 



Z^ Pm+i 



sup 



-Af 



j=0 



f^{A) - /2-,](^) < -^^^ . (4.16) 

1=0 



In regime (v), Theorem ^^ yields the following regeneration scheme. Let N G {0, 1} be 
the random counting measure defined by 

N(j):=l{r[j,oo]=j}. (4.17) 

Let {Ti : i £ Z) he the ordered time events of N defined by N(i) = 1 if and only if i = T^ for 
some i, T£ < T^+i and Tq < < Ti. 

10 



Corollary 4.18 (Regeneration scheme) ///5 > 0, then the process N defined in ( U-lV is a 
stationary renewal process with renewal distribution 

P(T^+i -Te>m) = p^ (4.19) 

for m > and i ^ 0. Furthermore, the random vectors S,e G Dn>iG"', i ^TL, defined by 

UU.) ■■= mU){Ti),...,mr){Ti+i-l)) (4.20) 

are mutually independent and {^eiU.) • ^ 7^ 0) ^'"e identically distributed. 



5 Distribution of r[s, t] and r[s, oo 

In this section we prove items (i) and (iv) [and the corresponding parts in (v)] of the theorem. 
We fix an increasing sequence of numbers (a^) such that 

Ok y 1 as k y oo (5-1) 

and define the following house of cards family of chains ((1^™ : n > m) : ttt, G Z) by VF^ = 
and for n > m: 

Wr := (W^-i + 1) HUn < aw^i,} (5.2) 

where Un are the uniform random variables introduced in the previous section. Notation alert: 
Here W^ is not a vector of lenght n — m, but the position at time n of a chain starting at 
time m at the origin. Alternatively, ¥{W^ = y \ W^_i = x) = p{x, y), where the latter are the 
transition probabilities defined in item (iv) of Theorem |4.3|. Notice that 



Pk = P(M^r+fc = 0) (5.3) 

for all m G Z and /c G N*. The monotonicity of the a^'s implies that 

W^ > W^ for ah m < /c < n . (5.4) 

Hence, W^ = implies that iy„ = for m < k < n, and the chains coalesce at time n: 

W^ = =^ Wl^ = Wt\ m<k<n<t (5.5) 

(furthermore, W/" > W^ for t smaller than the smallest such n). 



11 



By the definition (|4.2| ) of r, we liave for j < s 



r[s,t] <j 



rm—l 

■ n 







Vm G [j, s],3n e [m, t] : W^ 

Vm € [j, s],3ne [s, t] : W^'^ = 

max{m < s : Vn G [s, t] , W™ > 0} < j - 1 

3n G [s, t] : TV'^"^ = 



(5.6) 



where the second Hne follows from the coalescing property ( [5.51 ), and the last line is a consequence 
of the second one and the monotonicity (5.4). From the third line we get 



T[s,t] 



1 + maxjm < s : W^ > 0, Vn G [s,t]\ . 



(5.7) 



Lemma 5.8 

a) Condition X]„>o nfe=i o-k = oo is equivalent to any of the two following properties: 

(a.l) The chain iW^ : n > m) is non positive- recurrent for each m. 
(a. 2) For each — cxd < s <t < oo, T[s,t] > — oo a.s.. 

h) Condition Hfcli Ofc > is equivalent to any of the two following properties: 

(h.l) (VF™ : n > m) is transient 

(h.2) For each —oo < s, t[s, oo] > -co a.s.. 



In both cases, inequality ( \4-'iV holds 



Proof, a) It is well known that Wn is positive-recurrent if and only if X^„>onfc=i '^k < oo, 
(for instance Dacunha-Castelle, Duflo and Genon-Catalot (1983), p. 62 ex. E.4.1). Also, by 



(5.9) 



m 



{r[s,t]<m}= U {Ty--i = 0} 

i€[s,t] 

for m < s. By translation invariance, the probability of the right-hand side of (5.9) satisfies 



P( U i^-m+i = 0}) e 

ie[s,t] 



t-s 



'{Wt-m = 0),Y.^(.Wl 



0) 



4 = 1 



(5.10) 



where the right-hand-side is a consequence of the monotonicity property ( pA\ j. As m — > — oo 
this interval remains bounded away from in the positive-recurrent case, but shrinks to 
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otherwise. Since P(r[s,i] = — oo) = liTam^-oo^{T[s,t] < m), this proves that (a.l) and (a.2) 
are equivalent. 

b) Clearly, 

oo 
P(W^ / 0, Vn > m) = JJai . (5.11) 

i=0 

This implies that the product of a^ is positive if and only if the house-of-cards process is 
transient. From ( |5.9| ) we have that 



F{T[s,oo]<m)=F( (J {W^ = 0}] (5.12) 

j£[s— m+l,oo] 

which goes to zero as m ^ — oo in the transient case only. Therefore (b.l) and (b.2) are 
equivalent. 

Inequality (g^) follows from (U) or {^J^, due to (^. D 



The following proposition is due to Bressaud, Fernandez and Galves (1999b). 

Proposition 5.13 Let a^ be a [0,l]-sequence increasing to one. Let pk be the probability of 
return to the origin at epoch k of {W^ : n> 0). 

(i) If E„>o HLi flfc = oo, then p„ -^ 0. 

(^^) VUkLi^^k > 0, then J2n>oPn < oo- 

(Hi) If (1 — a„) decreases exponentially then so does pn- 

(iv) 



TT Ofc > and suplimsupl- ^j < 1 imply pn = 0(1 — On) (5.14) 

._, i fc— »oo \ J- ^ki J 



Item (iv) can be applied, for instance, when a^ ~ 1 — (log n)n '^ for 7 > 1. Items (i) and 
(ii) are direct transcriptions of (a.l) and (b.2) of the previous lemma. 

Remark 5.15 The results of this section are also monotonic on the choice of sequence (a^): A 



sequence (a^), satisfying (5.1), with a^ < a^ for all k, yields chains W*^ with larger probability 



of visiting 0, and hence with larger regeneration intervals s — T*[s,t] and larger values of pj^. 



This is the content of Remark 4.10 
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Bndl 







Bi(ik:l^ 

I 



\ 



Bo(2) 



B,{2\wZl] 



^kil\wZl) 



Bki2\wZl] 





ao 










fli 













flfc 



Figure 1: Partition {^k{g\wZl) ■ g S G, A: G N} of [0, 1[ in the case G = {1, 2} 



6 Construction of <l> 



In this section we prove results needed to show (ii) and (iii) [and the corresponding part in (v)] 
of Theorem 4.3. The results hold for any sequence (a^) satisfying (4.11), but for notational 
simplicity we omit the superscript "*". For g G G let bo{g) := ao{g), and for k >1, 

higlwZl) ■■= ak{g\wzl) - ak-iig\wzl). 

For each wZlo ^ G~^* let {BkiglwZl) : 5 S G, A; G N} be a partition of [0, 1[ with the following 
properties: (i) for g £ G, k > 0, Bf^{g\wZf.) is an interval closed in the left extreme and open in 
the right one with Lebesgue measure |B/;(5r|ii;2^,)| = bk{g\wZi^); (ii) these intervals are disposed 
in increasing lexicographic order with respect to g and k in such a way that the left extreme of 
one interval coincides with the right extreme of the precedent: 



Bo(5i), Bo(52), • • ■,Bi{gi\w_^), Bi{g2\w_^), . . . ,62(51 |u;_2),B2(52|?i'_2)' • • • 

is a partition of [0,1[ into consecutive intervals increasingly arranged. This definition is illus- 
trated in Figure 1 in the case G = {1,2}. 
In particular we have 



(J Bfc(5l 



w 



-k> 



fc>0 



Y,\Bk{9\wZl)\ =P{g\wzl) 



(6.1) 



fc>0 
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and 

U [jBkig\wZl) = [0,1[ 

gGG k>0 

where all the unions above are disjoint. For A; > let 

BkiwZl) := U Bk{g\wZl). 

geG 



(6.2) 



By Q and (|4llD , we have 



[0,ak[ C UBf(ii;:i) 



1=0 



0, X]"'^'(5'k-fc) 



for all w_l^ € G" 



-N* 



As a consequence, 



a fact that makes the definitions 

Be^k{'w-i,...,w-k) 
Bt^kig\w-i,...,w-k) 



iov i> k 



[0,ak[nBeig\wZl) ■ 



(6.3) 
(6.4) 

(6.5) 
(6.6) 



meaningful for all £,k > 0. 

Items (ii) and (iii) [and the corresponding part in (v)] of Theorem |4.3| are immediate conse- 
quences of the following proposition. Similar to ( |3.5[ ), define 



''"N(m) •= max< s < n : Uj < a-j-s, j € [s,n] > . 



Proposition 6.7 Let go be an arbitrary point in G. Define the function $ : [0, 1[^^ G ,u i— > 
X = $(n), recursively from j = T[n]{u) to j = n in the following fashion: write r = T[n](u) and 
define 



gdG 
Cr+i := ^gVyar+i^BQ{g) \jBi^i{g\xr)^ 



Xn ■■= ^^glyjin^ IJ Bj^n~T{g\Xn-l,---,Xr)^ (6.8) 

gGG 0<i<r 

i/r[n](n) is finite, and x„ = g^ if tV'\{u) = — oo. (Each sum in \6.d^ reduces to a single term.) 
Then: 
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(i) $ is well defined and measurable. 

(a) For each n ^TL, the component ^{u){n) depends only on the values of Ui on the interval 
[T[n](u),n] if T[n]{u) > — oo, and on {ui : i < n) otherwise. 

(Hi) If^{T[n]{U_) > — cxd) = 1, for all n G Z, then the law fi of ^{U_) is compatible with P and 
fi is stationary. 

Proof. Let 

An = lue [0, if : T[n]{u) > -00} . (6.9) 

(i) On the set An, the consistency of the definition |6.8| follows from two facts: (1) By definition, 
T[n] < t[j] for j E [T[n],n], and (2) (|6.3|) shows that if the event {Un < a^} holds, then we 
only need to look at Xn-i, • • • , a^n-fe to obtain the value of Xn- These facts imply that for every 
k £ [t [n] , n] , the value of Xk computed using (|6.8| ) with k in place of n yields the same value as the 
one obtained as part of the recursive calculation ( |6.8| ) for x„. The J^{Ui : i < n)-measurability 
of Xn follows, then, from definition (|6.8|). As the sets A^ are T{Ui : i < n)-measurable, see 
(^I^), the maps x„ remain measurable after gluing. We conclude that $(u) = {xn',n € Z,) is 
well defined and measurable. 

(a) Immediate from the definition (now known to be consistent). 
(Hi) When An is true, ([6.4]) implies that Definition |6.8| amounts to 

Xn = J^ffljnn G U ^kigKzl)} . (6.10) 



geG 



k>0 



which implies 



Ui:i<n;An 



Un^ljB,{gmu)]':r-l) 



k>0 



Ui : i < n ; Aj 



(6.11) 



Since each ^{U_){j) is J^{Ui : i < j)-measurable, and each [/„ is independent of J^{Ui : i < n), ii 
An has full measure, then (|6.1lj) equals 



[jBJg mU)] 



]n— 1 
in-k 



k>0 

by ( |6.lD . In other words, 

pf$(C/)(n) 



fc>0 



k{g 



mu.)] 



n-l 
n—k 



p{g I mu.)] 



n-l 

00 



g 



Uj : i < n 



P g 



^{U){i) : i < n 



3.12) 



(6.13) 



As the right-hand side depends only on i^{U_){i),i < n), so does the left-hand side. Hence the 
law of $(C/)(n) given ('5(IZ)(i),i < n) itself is still given by P. Therefore the distribution ^ of 
^{U_) is compatible with P in the sense of ( |2.2D . It is stationary by construction. D 
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Remark 6.14 The previous argument shows, in particular, that if a^ < Uk for ah A; > 0, and 

T*[s,t]{u) is finite (in which case T[s,t] < r*[s,t] is finite, see Remark 5.15| ), then ^{u){j) = 
<l>*(n)(j) for ah j € [s,t]. (Each ^*{u){j) depends on a larger number j — t*[j] of preceding 
U's.) If Condition (U) holds for the coefficients P^ = OfcLo'^fc' both $* and $ have laws 
compatible with P. 



Proof of Corollary 4.12| . We represent the one-sided conditioned measure through a family 
of functions ^{-{Wji) : [0, 1]^ i-^ G^ corresponding to a history w G G^ frozen for times smaller 
than i € Z. Writing, for shortness, yj = ^{U_\l^ji){j)} we set yj = Wj for j < i and successively 
for n = z,i + 1, . . ., 



Y,9^{Une \jBk[g 



g&G 



fc>0 



Vn-l,--- ,yi,Wi-l,--- ,W. 



n—ki • • • j I • 



(6.15) 



Let /i be any probability measure on G compatible with P. From ( p. 3D and ( |6.1| ) we see 
that the law of ^(U\w_^ i) is a regular version of /U given rji^i = Wj-i, ryj_2 = ^i-2-, ■ ■ ■■ That is, 



/^(/ I ^(J) = nj„ j<i) = E[fmU\wZLi))] 



(6.16) 



for any continuous /. We now follow the classical arguments. 
Proof of (i): Let / be as stated. By (6.16) 

f^(f\vU) = w,,J<i)-f^(f\vU) = v,,J<i) = E[fmU\^-lo,i))-fmUKLi)) ■ 

(6.17) 
Since 

l{'^iU\w,i){n)^<^{U\v,i){n)\ < l{T[n] < i} (6.18) 

the absolute value of both terms in (|6.20|) is bounded above by 



2\\f\\^¥{T[s,t]<i). 



(6.19) 



To conclude, we use the bound (4.7). 

Proof of (ii): If fi and /u' are two measures on G compatible with P, 

W-f^'f\ = 
< 



Kdw_l,)fi(^f r]{j) = Wj, j <ij - fj.'{dv) /x'(^/ r/(j) = Vj, j < ij 
fJ.{dwZlo) fJ''idv) n(^f r]{j) = Wj, j < ij - fJ.(^f r?(j) = Vj, j < ij 



(6.20) 



which, by part (i), goes to zero as i — > — oo. D 
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Proof of Theorem 4.3 We finish this section showing how Theorem O follows from 
previous results. 

Lemma |5.8| proves (i) and (iv) and the corresponding part of (v) . The bounds mentioned at 
the end of (iv) are consequence of Proposition 5.13| . Proposition ^^ proves parts (ii) and (iii) 
and the corresponding part in (v). 



7 Perfect simulation 



In this section we prove Corollary ^.14 . The construction of the function ^^-.t relies on an 



alternative construction of the stopping time T[s,t]. Assume s < t < oo and define 

Z[s,t] := max{i^„ -n + s : ne[s,t]} >0 (7.1) 



where K^ = Kn{U_) is defined in (|3.4| ). .^[s,t] is the number of sites to the left of s we need to 
know to be able to construct $ in the interval [s,t]. Let Y-i := t + 1, Yq := s and for n > 1, 
inductively 

y„ := Yn-1 - Z[Yn^i,Yn^2 - I] (7.2) 

Then it is easy to see that 

T[s,t\ = lim Yn = max{y„ : Yn = Yn+i} a.s., (7.3) 

with the usual convention max0 = — oo. 

Construction of the perfect-simulation function ^^t 

1. Set Y_i = t + 1, Yq = s and iterate the following step until Yn = Yn-i'- 



• Generate Uy^, • • • , t/y^_i-i. Use (3^) to compute Ky,,, • • • , Ky^^-^ and ( [7J| ) and (7.2) 
to compute Y^+i. 

2. Let T = Yn 

3. Iterate the following procedure from k = t to k = t: 

• Define Xk using ( |6.8D 

4. Return {xj : j G [s,t]) = ^rA^r, ■ ■ ■ ,Ut){j) : j £ [s,t]). 
By definition of r, $,,*([/,, . . . , Ut){j) = ^{U){j), j G [s, t]. D 



Remark 7.4 The algorithm can be apphed for any choice of (a^, A; > 0) satisfying ( 4.11 ) and, 
in addition, 



En 



au = oo . 



m k=0 

The smaller the a^, the smaller the stopping times T*[s,t] of the resulting perfect-simulation 
scheme. Also the return probabilities p^ increase if the a^ increase, worsening the bound (4.16) 
on the user-impatience bias. 



8 Regeneration scheme 



In this section we prove Corollary 4.18. The stationarity of N follows immediately from the 
construction. Let 

/(i) :=p(n(-j) = 1IN(0) = i) (8.1) 



^.2) 



for j G N* . To see that N is a renewal process it is sufficient to show that 



n-l 



N{se) = l;i = l,...,n) = pl[f{s,+i 



se 



e=i 



for arbitrary integers si < • ■ ■ < Sfc. [From Poincare's inclusion-exclusion formula, a measure on 
{0, 1}^ is characterized by its value on cylinder sets of the form {( E {0, 1}^ : C(s) = 1, s G 5'} 
for all finite 5 C Z. For S = {si, . . . ,Sk}, a renewal process must satisfy ( ^.2]) .] For j G Z, 
f G ZU{oo}, define 



( {Uj+e<ai,£ = 0,...,j'-j}, if j</ 



H[J,f] ■■= I 



With this notation, 



and 



"full event" , 



N(i) = l{i?[i,oo]}, iGZ. 



if j > j' 



N(s,) = l;£ = l,...,n) = F{f]H[se,oo]'j 
From monotonicity we have for j < j' < j" < oo, 

H[j,j"]nH[j',j"] = H[j,j'-i]nH[f,n 



(8.3) 

(8.4) 
(8.5) 

(8.6) 
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and then, with Sn+i = oo we see that (^) equals 

n 

ll¥[H[se,se+i-l]] , (8.7) 

On the other hand, 

fij) = F{H[-j, oo] I H[0, oo]) = F{H[-j, -1]) (8.8) 

Hence, (^.7[) equals the right hand side of (W^ and we have proved that N is a renewal process. 



On the other hand, by stationarity, 

P(T^+i-r^ >m) = P('r[-l,oo] < -m+1 t[0,oo]=o) (8.9) 

and, hence, by ( |5.6| ) and ( |5^ ) 

FiTe+i -Te>m) = P(i^rr+' = 0) = Pm , (8.10) 

proving ( 4.19| ). 



The independence of the random vectors S,e follows from the definition of T^ and part (iii) of 
Theorem E^. n 



9 Applications 

9.1 Binary autoregressions, continued 



In this subsection we continue the discussion of example (3.1). Recalling the notations of Section 
1^, we define 



C+ = max{q'{x):xe[eo-Y,\^rn\,eo + Y,\'^rn\]} (9.1) 

m>0 m>0 

C- = mm{q'{x):x£[eo-Y,\Gm\,0o+Y.\Om\]} . (9.2) 



m>0 m>0 



From Definition (O), a simple computation shows that 

/? > ^^ ^ rfc < oo ^^ ^ A: I6lfcl < oo (9.3) 

k k 

and also that for |^fc| ~ Ck~'^, Condition ( |4.4| ) is satisfied for C < (2(7"*")"^, but not satisfied for 
C > (2C^)^^. Hence limsup^ A;^|^fc| < {2C'^)~^ is sufficient for being in the Harris regime ( [4. 4] ), 
and limsup^ |0fc|A:^ In k < oo implies that /3 > 0. 
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Since a^ given by (3^) has no simple expression in general, Remark 4.10| could be useful. 
Indeed, 

1 - 2C+rfc < Ofc < 1 - 2C^rk . (9.4) 

Under the extra condition that Ok i^ ^ for infinitely many /c's, we have in fact ak ~ 1 — 2C^rfc 
as /c — > oo. 

We can replace the coefficients o^ with the following lower bounds. We choose first some 
integer fco such that 2C^rkQ < 1 and define 

4 = afc A(l-2C+rA,.J for fc < /cq , 

(9.5) 

al = l- 2C+rfc for k > ko . 

We can use the modification of our algorithm at the end of Section with these coefficients a^. 
Note that we only need to compute at most ko different a^'s. 
We now discuss two well-studied choices for q. 

Sigmoid case: In addition we assume here that q is concave on M+ with q{x) + q{—x) = 
1, a; S M. One natural choice is 

qix) = ; — = z — , (9.6) 

^ ^ 2 cosh X 2(l + exp(-2a;)) ^ ' 

i.e., the so-called logistic function and logit model (Guyon (1995), Ex. 2.2.4), where the explica- 
tive variables are the values of the process in the past. For a general sigmoid q, the supremum 
in ( [3l^ ) is achieved for wZl minimizing \6o + ^i<m<k^mW-m\- 

Linear case: We take now q{x) = (1 + rc)/2, and necessarily |^o| + J2m>o l^nil < ^- ^^ ^^ 
will see, linearity makes the model ( ^.1| ) somehow trivial, but also instructive. Writing B{p) for 
the Bernoulli distribution with parameter p and 6 the Dirac measure, we note that the one-sided 
conditional law (^]^) is given by a convex combination 

o/i-M „f^ + ^o + Ek>i^kW-k\ M+^o-r-o , >r-,„ |Sign(^fc)tt;_fe + l 
P{-\w-^) = Bi ^ =B{ + ^10,1 

\ k>l 

= (l-ro)e((l + 0o-ro)/2(l-ro))+J]l0fcl<5sign{e,)«,_, , (9.7) 



k>l 



-1^ 



since B{Xp + (1 — X)p') = XB{p) + (1 — X)B{p') for A € [0, 1]. In this example we have ak{w_^j — 
1 — r^ = dk independent of w, and for A; > 1, 

, , 16*4.1 if t(;_fc = sign(6'fc) 

bu{±l\w-_l) = {ekW-k + M)/'! = { £{0,1^^1}. (9.8) 

otherwise 
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Hence one of the two intervals ^ki^^wZ^) is empty, while the other one ^k{sign{9k)w-k\wZf:) 
has length \6f^\. This is in accordance with formula ( |9.7D . 

In other respects, the decomposition ( |9.7D can be directly interpreted in terms of simulation: 
the value of the process at time i = is chosen according to a "new" coin tossing with probability 
1 — ro, and set to the value sign{6k)w-k with probability \6h\ (k = 1,2,...). In the latter 
case the value of w^k is needed, and will be constructed using ( |9.7] ) again, etc. . . Clearly this 
recursive construction will eventually stop if and only if \9q\ + X^,„>o l^'"l < ^- However, our 
construction in this paper requires the extra condition ( |4.4| ), which loosely speaking, amounts 
to limsup;;. A;^|0fc| < 1. The reason for stronger assumptions is that, in order to cover general 
processes, we need in our general construction to check all intermediate times between and 
t[0], though in the special case of P given by ( |9.7| ), it is not necessary to construct all of them 
following the above lines. 

9.2 Markov chains defined by D-ary expansions 

These are processes having the unit interval as "alphabet", / = [0,1], and defined through 
another, auxiliary, process with a finite alphabet. Formally, a family of maps is established 
between sequences of a finite alphabet G = {0, 1, • • • , -D — 1} and real numbers in / via D-ary 
expansions: For each n G Z 

Xn-.G^ -^ I 

(9.9) 
{r]{i):ieZ) ^ Xn = Y.T=iV{n - j)/D3 . 

This map induces a natural map from probability kernels P : G x G^^' i-^ [0, 1] to probability 
kernels F : / x / i-^ [0, 1]: For each x G /, given an wZ^o G G~^* with x = Xo{wZ '' 

f(x, = ^±^ 
V D 

Interest focuses on the existence and properties of measures on the Borelians of / compatible 
with such a (Markov) kernel F. 

Maps (p.g|)-( priO| ) have been already introduced by Borel (1909) for i.i.d. r]{i). The general 
case in which the ri{i) form a chain with long memory is the object of Harris (1955) seminal 
paper. They are the prototype of the random systems with complete connections mentioned 
in the Introduction. Harris determines conditions for the existence and uniqueness of these 
processes, through the study of long-memory chains: if the finite-alphabet chain satisfies a 
condition similar to (but weaker than) ([4.4|), there is a unique process Ai on / compatible 



-OOJ 



Xo = x) = P{g\w). (9.10) 



with F. This process A4 is of course a (stationary) Markov chain with transition probability 

22 



kernel F. Harris shows that its marginal distribution is continuous, except in the degenerate 
case with constant ?7(i)'s where it is concentrated on one of the points 0,1/(1) — 1),2/{D — 
1),---,1. Furthermore, if the process is mixing and not degenerate, this marginal is purely 
singular whenever the variables r](i) are not independent uniformly distributed (in which case 
the marginal is uniform). 

Our approach yields, in a straightforward way, a perfect-simulation scheme for the measures 



A4 obtained in this fashion, if the auxiliary process ri(i) satisfies condition (4.4). Indeed, the 

map 

X-.G-^' -^ I 

(9.11) 

can be made bijective by fixing rules to decide between sequences which are eventually and 
those that are eventually D — 1. In turns, this map induces a bijection between the sigma 
algebra S(, formed by unions of intervals with endpoints in multiples of D^ , and the subsets of 
(^[-^.-i]. We conclude that, if /i and A4 are the processes compatible with the kernels P and F 
related as in (|9.10| ), then the restriction of J\A to S^ can be perfectly simulated by mapping, via 
X, the perfect samples of the measure ^ on the window [— ^, —1], obtained by the algorithm of 
Section 0. We point out that the union of the families of M.£ uniquely determines the measure 
M. (it forms a so-called tt system). 

Conclusion For Markov chains in general state-space with transition kernel -P(x, • ) satis- 
fying the Doob's condition P(x, • ) > (3ip{ • ), for all state x, some /3 > and a measure ^ on 
the state-space, the forward coupling is well known. The corresponding coupling-from-the-past 
algorithm is illustrated in Example 2 of Foss and Tweedy (1999) and in Corcoran and Tweedie 
(1999). Notice however that the mere existence of a minorization measure is not sufficient to 
construct the couplings: one needs to explicitely know ip and (3. In this section we have discussed 
an example of a Markov chain with state-space [0, 1] that can be transcribed as a chain with 
complete connections and state-space {0, 1, ... D — 1}. Perfect simulating the latter provides 
a perfect simulation for the former. Instead of the exibition of a minorization measure for the 
Markov chain our method requires the knowledge of ak{g\wZ^) and lower bounds of a^ for the 
related infinite-memory chain. 
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